Predicting Prokaryotic Ecological Niches Using Genome Sequence Analysis

نویسندگان

  • Garret Suen
  • Barry S. Goldman
  • Roy D. Welch
چکیده

Automated DNA sequencing technology is so rapid that analysis has become the rate-limiting step. Hundreds of prokaryotic genome sequences are publicly available, with new genomes uploaded at the rate of approximately 20 per month. As a result, this growing body of genome sequences will include microorganisms not previously identified, isolated, or observed. We hypothesize that evolutionary pressure exerted by an ecological niche selects for a similar genetic repertoire in those prokaryotes that occupy the same niche, and that this is due to both vertical and horizontal transmission. To test this, we have developed a novel method to classify prokaryotes, by calculating their Pfam protein domain distributions and clustering them with all other sequenced prokaryotic species. Clusters of organisms are visualized in two dimensions as 'mountains' on a topological map. When compared to a phylogenetic map constructed using 16S rRNA, this map more accurately clusters prokaryotes according to functional and environmental attributes. We demonstrate the ability of this map, which we term a "niche map", to cluster according to ecological niche both quantitatively and qualitatively, and propose that this method be used to associate uncharacterized prokaryotes with their ecological niche as a means of predicting their functional role directly from their genome sequence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast identification of gene clusters in prokaryotic genomes

The detection of gene clusters that are conserved in several genomes, in terms of gene proximity and gene content, have proved to be an invaluable tool in the comparative analysis of prokaryotic genomes. It has applications, for example, in predicting functional association between groups of genes or putative genome rearrangements. We propose an efficient algorithm for computing gene clusters, ...

متن کامل

Predicting Ecological Roles in the Rhizosphere Using Metabolome and Transportome Modeling

The ability to obtain complete genome sequences from bacteria in environmental samples, such as soil samples from the rhizosphere, has highlighted the microbial diversity and complexity of environmental communities. However, new algorithms to analyze genome sequence information in the context of community structure are needed to enhance our understanding of the specific ecological roles of thes...

متن کامل

A Flood of Microbial Genomes–Do We Need More?

Complete genome sequences of important bacterial pathogens and industrial organisms hold significant consequences and opportunities for human health, industry and the environment. Addressing biological and clinical problems through genome sequence based approaches offers many commercial opportunities. The aftermath of whole genome sequencing has revealed new insights into evolution of bacterial...

متن کامل

Marker genes that are less conserved in their sequences are useful for predicting genome-wide similarity levels between closely related prokaryotic strains.

BACKGROUND The 16s rRNA gene is so far the most widely used marker for taxonomical classification and separation of prokaryotes. Since it is universally conserved among prokaryotes, it is possible to use this gene to classify a broad range of prokaryotic organisms. At the same time, it has often been noted that the 16s rRNA gene is too conserved to separate between prokaryotes at finer taxonomi...

متن کامل

Pre_GI: a global map of ontological links between horizontally transferred genomic islands in bacterial and archaeal genomes

The Predicted Genomic Islands database (Pre_GI) is a comprehensive repository of prokaryotic genomic islands (islands, GIs) freely accessible at http://pregi.bi.up.ac.za/index.php. Pre_GI, Version 2015, catalogues 26 744 islands identified in 2407 bacterial/archaeal chromosomes and plasmids. It provides an easy-to-use interface which allows users the ability to query against the database with a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PLoS ONE

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2007